A multiple imputation procedure of censored values in family-based genetic association studies
نویسندگان
چکیده
Biological quantitative data are subjected to censoring when a portion of values cannot be quantified because they are smaller or greater than the limit of detection (LOD) of the laboratory assay. In genetic association studies of quantitative trait, the handling of censored data has received little attention and often the solutions are unsatisfactory. However, the approach used to deal with such data can have a substantial impact on the results of the analysis. While the Tobit model represents an appropriate method for independent data, there is no evidence on its performance in the presence of non-independent observations, typical of familyor pedigreebased studies. In the context of a family-based study, we propose a Bayesian approach which takes into account the uncertainty of the imputation procedure using several imputations for each censored value. In particular, assuming vague (uninformative) priors for all hyper-parameters, the imputation based on Gibbs sampling is applied to variance-components linear regression models, where the primary outcome is related to a secondary outcome. Through simulation, we describe the behavior of the Tobit model in the presence of different degrees of censoring and heritability of the trait compared with the Bayesian model and the naı̈ve approach of replacing all censored values with the LOD value.
منابع مشابه
Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملCombining Survival Analysis Results after Multiple Imputation of Censored Event Times
Multiple Imputation (MI) is an effective and increasingly popular solution in the handling of missing covariate data as well as missing continuous and categorical outcomes in clinical studies. However, in many therapeutic areas, interest has also risen in multiple imputation of censored time-to-event data, because in many cases the Censored at Random (CAR) assumption is not clinically plausible...
متن کاملGoodness-of-fit Tests for Archimedean Copula Models
In this paper, we propose two tests for parametric models belonging to the Archimedean copula family, one for uncensored bivariate data and the other one for right-censored bivariate data. Our test procedures are based on the Fisher transform of the correlation coefficient of a bivariate (U, V ), which is a one-toone transform of the original random pair (T1, T2) that can be modeled by an Archi...
متن کاملNon-iterative, regression-based estimation of haplotype associations with censored survival outcomes.
The general availability of reliable and affordable genotyping technology has enabled genetic association studies to move beyond small case-control studies to large prospective studies. For prospective studies, genetic information can be integrated into the analysis via haplotypes, with focus on their association with a censored survival outcome. We develop non-iterative, regression-based metho...
متن کاملToward improved statistical methods for analyzing Cotinine-Biomarker health association data
BACKGROUND Serum cotinine, a metabolite of nicotine, is frequently used in research as a biomarker of recent tobacco smoke exposure. Historically, secondhand smoke (SHS) research uses suboptimal statistical methods due to censored serum cotinine values, meaning a measurement below the limit of detection (LOD). METHODS We compared commonly used methods for analyzing censored serum cotinine dat...
متن کامل